Recursive weighted partial least squares (rPLS): an efficient variable selection method using PLS

نویسندگان

  • Åsmund Rinnan
  • Martin Andersson
  • Carsten Ridder
  • Søren Balling Engelsen
چکیده

Variable selection is important in fine tuning partial least squares (PLS) regression models. This study introduces a novel variable weighting method for PLS regression where the univariate response variable y is used to guide the variable weighting in a recursive manner—the method is called recursive weighted PLS or just rPLS. The method iteratively reweights the variables using the regression coefficients calculated by PLS. The use of the regression vector to make up the weights is a reasonable idea from the fact that the weights in the regression vector ideally reflect the importance of the variables. In contrast to many other variable selection methods, the rPLS method has the advantage that only one parameter needs to be estimated: the number of latent factors used in the PLS model. The rPLS model has the fascinating output that it, under normal conditions, converges to a very limited number of variables (useful for interpretation), but it will exhibit optimal regression performance before convergence, normally including covarying neighbor variables. This study examines the properties of rPLS by application to a near-infrared spectroscopy dataset of feed samples predicting the protein content and to a metabolomics dataset modeling a reference metabolic parameter (creatinine) from nuclear magnetic resonance spectra of human urine. Copyright © 2013 John Wiley & Sons, Ltd.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An lnterpretation of Partial Least Squares

Univariate partial least squares (PLS) is a method of modeling relationships between a Y variable and other explanatory vanables. It may be used with any number of explanatory variables, even far more than the number of observations. A simple interpretation is given that shows the method to be a straightforward and reasonable way of forming prediction equations. Its relationship to multivariate...

متن کامل

On-line Sequential Extreme Learning Machine Based on Recursive Partial Least Squares

This paper proposes the online sequential extreme learning machine algorithm based on the recursive partial leastsquares method (OS-ELM-RPLS). It is an improvement to the online sequential extreme learning machine based on recursive least-squares (OS-ELM-RLS) introduced in [1]. Like in the batch extreme learning machine (ELM), in OSELM-RLS the input weights of a single-hidden layer feedforward ...

متن کامل

Weighted Averaging Partial Least Squares regression ( WA - PLS ) : definition and comparison with other methods for species - environment calibration

(1993). Weighted averaging partial least squares regression (WA-PLS): definition and comparison with other methods for species-environment calibration. ABSTRACT A multivariate calibration method is proposed that relates to correspondence analysis as partial least squares (PLS) relates to principal components analysis. The new method extends and improves the weighted averaging (WA) method for in...

متن کامل

Evaluation of the Bitterness of Traditional Chinese Medicines using an E-Tongue Coupled with a Robust Partial Least Squares Regression Method

To accurately, safely, and efficiently evaluate the bitterness of Traditional Chinese Medicines (TCMs), a robust predictor was developed using robust partial least squares (RPLS) regression method based on data obtained from an electronic tongue (e-tongue) system. The data quality was verified by the Grubb's test. Moreover, potential outliers were detected based on both the standardized residua...

متن کامل

Soft Sensor Model Maintenance: A Case Study in Industrial Processes

One of the challenges of utilizing soft sensors is that their prediction accuracy deteriorates with time due to multiple factors, including changes in operating conditions. Once soft sensors are designed, a mechanism to maintain or update these models is highly desirable in industry. This paper proposes an index that can monitor the prediction performance of soft sensor models and provide guida...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014